Optical spot grid array scanning system

ABSTRACT

A method for scanning a surface, consisting of focusing an array of optical beams using optics having an axis, so as to illuminate a region of the surface intercepted by the axis, such that each optical beam illuminates a portion of a respective sub-region within the region. The method further includes moving at least one of the array and the surface so as to cause a translation of the surface relative to the axis in a first direction. During the translation in the first direction, each of the optical beams is scanned over the respective sub-region in a second direction, which is different from the first direction.

CROSS-REFERENCE TO RELATED APPLICATIONS

This application claims priority from and is a Continuation in Part of U.S. patent application Ser. No. 11/334,963 entitled “Spot Grid Array Scanning System”, filed Jan. 19, 2006, and claims the priority benefit of U.S. Provisional Patent Application 60/647,970, filed Jan. 26, 2005, each of which prior applications is incorporated herein by reference.

FIELD OF THE INVENTION

The present invention relates generally to semiconductor wafer fabrication, and specifically to inspection and/or production of the wafer, and/or of elements used for the wafer fabrication.

BACKGROUND OF THE INVENTION

In the fabrication of a semiconductor wafer, as well as in the production of elements, such as reticles, used in the fabrication, there are typically a large number of steps. Each of these steps is time-consuming, and for efficient fabrication it is important to minimize the time taken while maintaining the quality of the steps. For example, in inspecting the wafer during or at conclusion of the fabrication, a number of methods for minimizing the inspection time are known in the art.

Inspection may be performed by scanning the wafer with an electromagnetic wave beam, and observing properties of the beam after it has interacted with the wafer, typically with the wafer surface. Typically the beam is used to generate an image of the wafer surface. The scanning and/or imaging inspection time may be reduced by using multiple inspection beams, each beam having generally similar properties. The beams are focused onto the wafer surface, and are scanned across the surface in tandem. Using a multiplicity of inspection beams produces a corresponding saving in time to scan and/or image the wafer surface, compared to the time taken if only a single beam is used.

PCT Application WO 03/040709, to Almogy, et al., whose disclosure is incorporated herein by reference, describes an optical imaging system which uses a plurality of optical beams from a spot grid array, the beams being focused onto the surface of a wafer. The wafer is moved so that the focused spots continuously and linearly traverse the surface. The spots are offset relative to each other so that the traversed lines followed by adjacent spots do not overlap, but do touch each other. One scan of the array thus images a relatively large swathe of the wafer.

SUMMARY OF THE INVENTION

In an embodiment of the present invention, an array of optical beams is focused to an array of respective spots on a surface, typically for the purpose of inspection and/or lithography of the surface. The surface is moved linearly in its own plane, and the array of spots is moved in the plane in a discontinuous manner so that the spots remain within an optimal focusing region of the plane. The motions of the array and of the surface are set so that the array of spots illuminate a swathe of the surface, parallel to the linear motion of the surface, while continuing to be focused onto the surface within the optimal focusing region. The motions are also set so that substantially every point within the swathe is illuminated by at least one of the spots. All of the swathe may thus be illuminated in an efficient manner.

In one embodiment, the surface moves with a constant velocity, and the array of spots is moved with a combination of the constant velocity and a scan, typically a raster scan at right angles to the constant velocity, so that each spot illuminates an approximately equal-sized rectangular block of the surface. The set of blocks are contiguous and completely cover an initial region of the surface. Once the set of blocks has been scanned, the array constant velocity is halted, and the array of spots is repositioned in a direction opposite to the direction of the surface constant velocity. The array then moves with the combination motion to illuminate a new region of the surface, with a new set of blocks. The size of the repositioning is set so that the new region scanned is contiguous with the initial region. Alternatively, there may be a small region of overlap between the initial region and the new region.

The repositioning is performed substantially instantaneously, and the process of repositioning is herein also termed jumping. The process of scanning and jumping continues until the swathe is completely scanned. The combination of scanning then jumping in a direction opposite to the velocity of the surface ensures that the spots are maintained within the optimal focusing region.

In an alternative embodiment, the surface moves with the constant velocity, and the array of spots is moved with a combination of the constant velocity and scan at right angles to the constant velocity, so that each spot illuminates an approximately equal-sized rectangular sub-block of the surface. The sub-blocks in a first set of sub-blocks are not contiguous and incompletely cover the initial region. Once a first set of sub-blocks has been scanned, the array of spots makes a “sub-jump,” in a direction having a component opposite to the direction of the surface constant velocity, and scans a second, different, set of sub-blocks. The process of “sub-jumping” and scanning of sets of sub-blocks is repeated, and the size and direction of the sub-jumps are chosen so that the sets of sub-blocks become contiguous and completely fill the swathe. Using sub-jumps of a relatively small magnitude allows the array of spots to be close in size to the size of the optimal focusing region.

In some embodiments, the surface is a semiconductor wafer surface having a multiplicity of dies. The size of the array of spots may be set, by changing the magnification of a system focusing the beams to the spots, so that corresponding points of dies, lying within the swathe being scanned, are illuminated by the same spots. Alternatively or additionally, for embodiments using sub-jumps, the size of the component of the sub-jump opposite to the direction of the surface constant velocity may be varied, typically between two values. Such a variation may be chosen so that corresponding points of the dies are illuminated by the same spots, so that scanning results, such as die-to-die comparisons, are facilitated.

The present invention will be more fully understood from the following detailed description of the embodiments thereof, taken together with the drawings, a brief description of which follows.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1A is a schematic illustration of inspection apparatus, according to an embodiment of the present invention;

FIG. 1B is an enlargement of a section of FIG. 1A, according to an embodiment of the present invention;

FIGS. 2A and 2B are schematic diagrams illustrating focused spots in a region being scanned, according to an embodiment of the present invention;

FIG. 3 shows a snapshot after a number of raster scans have been performed by the apparatus of FIG. 1A or 1B, according to an embodiment of the present invention;

FIG. 4 is a schematic diagram illustrating focused spots in a region being scanned, according to an alternative embodiment of the present invention;

FIG. 5 is a diagram showing a snapshot of cells scanned by an array of the apparatus of FIG. 1A or 1B, according to the alternative embodiment of the present invention;

FIG. 6 is a diagram showing parameters of scanned rectangles after a number of scans of the apparatus of FIG. 1A or 1B have been made, according to the alternative embodiment of the present invention;

FIG. 7 is a diagram showing a relation between a scanned field of view and dimensions of dies being scanned, according to an embodiment of the present invention;

FIG. 8 is a diagram illustrating a method for maintaining die periodicity, according to an embodiment of the present invention;

FIG. 9 is a schematic illustration of an inspection apparatus, according to an alternative embodiment of the present invention;

FIG. 10 is a schematic illustration of an inspection apparatus, according to a further alternative embodiment of the present invention;

FIG. 11 is a schematic illustration of an inspection apparatus, according to a disclosed embodiment of the present invention; and

FIG. 12 is a schematic illustration of an inspection apparatus, according to a further disclosed embodiment of the present invention.

DETAILED DESCRIPTION OF EMBODIMENTS

Embodiments of the present invention focus an array of optical beams to an array of respective spots on a surface, and relate to methods for scanning the spots over the surface. Substantially any type of optical beam may be used. Thus the beams may be from substantially any relevant part of the electromagnetic spectrum, such as the visible, near-visible, or deep ultraviolet (DUV) regions of the spectrum.

Reference is now made to FIG. 1A, which is a schematic illustration of an optical inspection apparatus 10, according to an embodiment of the present invention. Apparatus 10 has a light source 12, typically a laser source, which is divided by a beam multiplexer 14 into an array 16 of generally parallel beams. Typically, a cross-section 18 of array 16 is symmetrical. Herein, by way of example, cross-section 18 is assumed to have six-fold symmetry, i.e., nearest neighbors of elements of the cross-section are at the vertices of equilateral triangles. However, it will be understood that cross-section 18 may have any other convenient form of symmetry, such as two-, three-, or four-fold symmetry. It will also be understood that any other convenient system, such as a multiplicity of generally similar beam sources, may be used to generate array 16, and that all such systems are assumed to be comprised within the scope of the present invention. Examples of such systems are described in more detail below, with reference to FIGS. 9-12.

A processor 36 operates the elements of apparatus 10. Objective optics 20 direct and focus beam array 16 onto a plane surface 22 of an element 24, producing an array of focused spots 27 on the plane surface of the element. Element 24 is mounted on a motion stage 38. Surface 22 is assumed to contain orthogonal x and y axes, and to be normal to a z axis. Processor 36 is able to operate stage 38 so that surface 22 translates in an x and/or a y direction, or in any combination of these directions. Thus stage 38 operates to translate surface 22 relative to optics 20. It will be appreciated that this relative translation may also be implemented by moving optics 20, using a motion stage generally similar to stage 38 coupled to the optics. Herein, by way of example, it is assumed that the relative translation between surface 22 and optics 20 is provided by operating stage 38 as a translator.

Optics 20 comprise a scanner 28, which is operated by processor 36 to scan focused spots 27 across surface 22. Systems corresponding to optics 20 and scanner 28 are well known in the art, the latter typically comprising a mirror and/or acousto-optic elements for diverting beams 16. Thus optics 20 and scanner 28 may be implemented from substantially any elements which are able to focus beams 16 onto respective portions of surface 22, and to scan the focused beams across the surface. Objective optics 20 may typically also be at least partly utilized as imaging optics 32, the imaging optics forming images of respective spots 27 onto a detector array 34. Signals from array 34 are processed by processor 36.

While scanner 28 is typically able to scan beam array 16 so that spots array 27 moves across relatively large areas of surface 22, in practice there is an aberration-corrected region 30 within which the focused spots may be brought to an optimal focus, and outside of which the focused spots are less than optimally focused. Region 30 is typically an approximately oval or circular region, approximately symmetrically located with respect to an optical axis 29 of apparatus 10. Region 30 is herein assumed to be contained within an approximately square region of side 500 μm. In embodiments of the present invention, scanner 28 maintains focused spot array 27 substantially within aberration-corrected region 30.

Apparatus 10 may be used, inter alia, for inspection and/or for photolithography of wafers and/or reticles. Hereinbelow, by way of example, it is assumed that element 24 comprises a semiconductor wafer having a plurality of substantially identical dies 25, and that apparatus 10 is used to inspect surface 22 of the wafer.

The following description is generally drawn to embodiments wherein the beams are generated using a particular type of electromagnetic wave, using optical inspection apparatus 10. It will be appreciated that this is by way of example, and those skilled in the art will be able to adapt the description to spots produced by substantially any type of optical beam.

FIG. 1B is an enlargement of a section of FIG. 1A, showing region 30, array 27, and dies 25 in more detail, according to an embodiment of the present invention. As is described in more detail below, processor 36 operates stage 38 and scanner 28 of apparatus 10 so that a swathe 39 is scanned across surface 22, and typically so that other swathes 13 are also scanned across the surface.

FIGS. 2A and 2B are schematic diagrams illustrating focused spots 27 in region 30, according to an embodiment of the present invention. In this embodiment, spots 27 are formed as 59 columns of 26 rows of spots, so that there are a total of 1534 spots, formed from a corresponding 1534 beams generated by multiplexer 14. The columns of spots define a beam array axis, and may be set to be parallel to the y axis. Each of the spots is assumed to be equidistant from its nearest neighbors, the distance being 10 μm. It will be appreciated that the above numerical values and orientation for spots 27 are provided by way of example, and that embodiments of the present invention may use substantially any convenient number, separation, and orientation of the spots. By way of example, a different number of spots is used, in an array generally similar in form to that described herein with respect to FIGS. 2A and 2B, in an alternative embodiment described with reference to FIGS. 4-6.

Spots 27 are initially positioned to be in an upper portion 32 of region 30. In the description herein, unless otherwise stated, terms such as “upper,” “lower,” and “left-side,” when used in relation to spots 27, are used to clarify the description, and refer to positions within FIGS. 2A and 2B and other figures herein, not to an aspect of apparatus 10. In operating apparatus 10, surface 22 (FIGS. 1A and 1B) is moved at a constant velocity V_(stage) in the y direction, so that the surface translates relative to axis 29. During a first part of scanning of spot array 27, the array moves with two components which are added vectorially: a component V_(stage), and a raster scan 80 component. An enlarged section I of raster scan 80 is shown in FIG. 2B, and in enlarged section I, individual spots of array 27 are labeled A1, A2, . . . , B1, B2, . . . , where A, B, . . . are column labels, and 1, 2, . . . are row labels. Enlarged section I illustrates the raster pattern of spots 27 due to raster scan 80 alone, and corresponds to the motion of the spots as seen by an imaginary observer on moving surface 22. Processor 36 sets the raster scan so that each spot of array 27 scans a substantially similar 10 μm×8.66 μm rectangle 82, by scanning a predetermined number of rows within the rectangle that are perpendicular to the y-direction. As described further below, the actual number of rows scanned depends on a size of scanned pixels in rectangle 82

Rectangles 82, generated by the raster scan, are contiguous and are arranged so that there are no gaps between rectangles. At the conclusion of the raster scan, spot A1 will be at a point 84, and an approximately rectangular region 86 of surface 22, shown in more detail in section II of FIG. 2B, will have been scanned by array 27. Region 86 is enclosed by two vertical sides 90 having lengths of 26×10 μm=260 μm, and two horizontal crenellated sides 88 and 92, having lengths of 59×0.866×10 μm=511 μm. Thus region 86 has approximate dimensions of: FOV _(y) ×FOV _(x)=260 μm×511 μm  (1)

-   -   where         -   FOV_(y) is the y Field of View value of the scanned region,             and         -   FOV_(x) is the x Field of View value of the scanned region.

It will be understood that so long as the array is moved with the same component V_(stage) as the stage, the dimensions FOV_(y), FOV_(x) of region 86 do not depend on the value of V_(stage).

Depending on the value of V_(stage), the position of array 27 will have moved from its initial upper portion 32. (If V_(stage)=0, the only movement of the array is because of the raster scan, and the position of array 27 remains substantially in upper portion 32.) Processor 36 sets the value of V_(stage) so that at the conclusion of the raster scan, bottom line 88 of region 86 is approximately coincident with a bottom line 31 of region 30 (FIG. 2A). In other words, during one scan, processor 36 moves array 27 from upper portion 32 to a lower portion 33 of region 30. As described in more detail below with reference to FIG. 3, at conclusion of the raster scan, processor 36 repositions array 27 back to its initial position in upper portion 32, and the processor then repeats the raster scan, while continuing to move the array and surface 22 with velocities V_(stage). The process of moving array 27 with a combination of raster scanning and velocity V_(stage) while moving surface 22 with velocity V_(stage), then repositioning the array back to its initial position, continues iteratively until a complete swathe 39 (FIG. 1B) has been scanned on surface 22.

The speed of the repositioning described herein, compared to the velocity V_(stage), is typically extremely high, so that the repositioning is essentially instantaneous. Typically, the time taken to reposition the array is less than 1% of the time for the array to scan. The time for the array to scan, T, is described in relation to FIG. 3 and also in relation to expression (3) below. Herein the term “jump” is also used to describe the repositioning that is implemented.

FIG. 3 shows a schematic “snapshot” 100 after a number of the scans of swathe 39 have been performed, as described above, according to an embodiment of the present invention. By way of example, four scans of substantially identical regions 86 of swathe 39 are assumed to have completed, and are shown in FIG. 3 as scans A, B, C, and D. Scan A is the first scan, and the most recent scan, scan D, is assumed to have just completed, so that apparatus 10 is about to begin a scan E. The position of region 30 is also shown in snapshot 100. After each scan A, B, . . . , processor 36 repositions the array in the negative y direction, by an amount that typically makes adjacent rectangles 86 contiguous with each other. In some embodiments of the present invention, processor 36 repositions the array so that there is a small overlap between adjacent rectangles 86.

Regardless of whether or not there is overlap between adjacent rectangles, a graph 107 of the y-position of array 27 vs. time is a saw-tooth graph, with the vertical lines 109 of the graph corresponding to the jumps applied to the array. T corresponds to the time for processor 36 to perform one scan of array 27, time T being equal to the time taken to scan one rectangle 82 (FIGS. 2A and 2B).

Typically, the value of T depends on the time taken by processor 36 to scan an individual pixel on surface 22, and on the number of pixels scanned by each spot of array 27. The latter also affects the number of rows scanned within each rectangle 82. Thus, if an individual pixel is square with a side p μm, the number of pixels N_(pix), and the number of rows R, scanned within a specific rectangle 82, are given by expression (2):

$\begin{matrix} \begin{matrix} {N_{pix} = \frac{86.6}{p^{2}}} \\ {R = \frac{10}{p}} \end{matrix} & (2) \end{matrix}$

If processor 36 takes a time T_(p) to scan a pixel, then the time T that the processor needs to scan a complete rectangle 82 is given by expression (3):

$\begin{matrix} \begin{matrix} {T = {N_{pix} \cdot T_{p}}} \\ {= {\frac{86.6}{p^{2}} \cdot T_{p}}} \end{matrix} & (3) \end{matrix}$

Processor 36 uses the value of T from expression (3) to adjust the value of V_(stage) so that scans such as scan A and scan B (FIG. 3) are contiguous or slightly overlapping. After each time T, processor 36 repositions array 27 to illuminate a new region of surface 22. Using the values for array 27 given above, in the case of no overlap, processor 36 repositions array 27 260 μm in the negative y-direction after each scan has completed. The value of 260 μm corresponds to the y height of the 26 rows scanned by array 27. For an overlap between scans of 10 μm, corresponding to the height of one rectangle 82, processor 36 repositions the array by 250 μm in the negative y-direction after each scan has completed. In either case, the value of V_(stage) is given by expression (4):

$\begin{matrix} {V_{stage} = \frac{{FOV}_{y}}{T}} & (4) \end{matrix}$

To account for a reposition time, i.e., a jump time, such as is described above, the value of V_(stage) given by expression (4) (and also that given by expression (8) below) may be reduced correspondingly, as will be apparent to those skilled in the art.

It will be appreciated that during the whole time that array 27 is operated, and as is illustrated in FIG. 3 for scans D and E, the focused beam of the array remains within region 30.

It will also be appreciated that swathe 39 is formed of contiguous or slightly overlapping regions 86, each one of which scans a portion of surface 22 with substantially no un-scanned parts within the regions. To scan the whole of surface 22, apparatus 10 typically scans multiple swathes, each swathe either being contiguous with each other, or slightly overlapping. For example, one or more second swathes 13 (FIG. 1B), adjacent to swathe 39, may be 511 μm from swathe 39, measured along the x-axis, giving no overlap, or may be 502 μm along the x-axis from swathe 39, giving an overlap of one rectangle 82.

It will be appreciated that, in contrast with other scanning processes known in the art, the relative beam positions of the spots generated by array 27 in the process described herein with reference to FIGS. 2A, 2B and 3 do not need to be very accurate. In particular, the process described herein is completely unaffected by any misalignment between the direction of travel of stage 38 and the direction of the columns of array 27.

As stated above, embodiments of the present invention described with reference to FIGS. 2A, 2B, and 3 provide a set of regions 86, each of the regions having substantially no un-scanned parts. After each region 86 is scanned, array 27 is repositioned. Swathe 39 is then formed by “joining” adjacent regions 86 together. In an alternative embodiment described below with reference to FIGS. 4-6, the same general pattern of scanning, repositioning the array in the negative y-direction, then scanning, is followed. Also, as in the embodiments described above, surface 22 and array 27 are both moved with a y-velocity V_(stage), the array also having a scan imposed on its V_(stage) motion. However, in contrast to the system described above, each specific scan of the alternative embodiment does have un-scanned regions. As is described below, the scans with un-scanned regions are combined together, and the combination eliminates the un-scanned regions.

FIG. 4 is a schematic diagram illustrating focused spots 27 in region 30, according to an alternative embodiment of the present invention. In the alternative embodiment, spots 27 are formed as 59 columns of 48 rows of spots, so that there are a total of 2832 spots, formed from a corresponding 2832 beams generated by multiplexer 14. The columns of spots are set to be parallel to the y axis. Each of the spots is assumed to be equidistant to its nearest neighbors, the distance being 10 μm, so that each spot defines contiguous rectangles 82 having dimensions 10 μm×8.66 μm, as described above with reference to FIGS. 2A and 2B. As shown in FIG. 4, unlike the embodiment described with reference to FIGS. 2A, 2B, and 3, the 2832 spots substantially completely fill region 30. Using the definitions for FOV_(y) and FOV_(x) given above in expression (1), the values for FOV_(y) and FOV_(x) are respectively 480 μm and 511 μm.

If the scanning system described above with reference to FIGS. 2A, 2B, and 3 is applied to the array of FIG. 4, i.e., if surface 22 is moved with a velocity V_(stage) and array 27 moves for a time T with a combination of V_(stage) and a raster scan, at the end of each scan approximately half of the array will be outside region 30.

To avoid this situation, some embodiments of the preset invention may use smaller reposition values, or jumps than the size of the jumps described above with reference to FIGS. 2A, 2B, and 3. These smaller jumps are referred to herein as “sub-jumps,” and the time difference between sub-jumps is reduced from the time T described above. Each rectangle 82 is divided into a number of generally similar cells 110. A respective cell 110 of every rectangle 82 is scanned using substantially the same method as described above for the embodiment of FIG. 2, i.e., surface 22 is moved with velocity V_(stage), and array 27 is moved with a combination of the same velocity V_(stage) and a raster scan that scans the specific cell. At the conclusion of the cell raster scan, array 27 makes a sub-jump in the negative y-direction, and another respective cell 110 of every rectangle 82 is scanned. The process of cell scanning and making sub-jumps in the negative y-direction continues. During an initial phase of the process, only some of the cells of rectangles 82 are scanned. Once the initial phase has completed, all the cells of the rectangles are scanned, so that continuing the process after the initial phase ensures that further rectangles 82 are completely scanned. The process for using the sub-jumps is described in more detail hereinbelow.

In each rectangle 82 there are assumed to be an integral number J of cells 110. The value of J is related to the number M of rows of array 27 according to expression (5):

$\begin{matrix} {J = \frac{M}{I}} & (5) \end{matrix}$

-   -   where I is a positive integer less than M.

The number of pixels N_(pixc) in each cell 110, and the time T_(c) needed to scan each cell, are given by expressions (6) and (7), respectively derived from expressions (2) and (3) above.

$\begin{matrix} {N_{pixc} = \frac{86.6}{J \cdot p^{2}}} & (6) \\ \begin{matrix} {T_{c} = {N_{pixc} \cdot T_{p}}} \\ {= {\frac{86.6}{J \cdot p^{2}} \cdot T_{p}}} \end{matrix} & (7) \end{matrix}$

The relation between FOV_(y) and V_(stage) is given by expression (8), derived from expression (4) above.

$\begin{matrix} {V_{stage} = \frac{{FOV}_{y}}{J \cdot T_{c}}} & (8) \end{matrix}$

After each time T_(c), i.e., after a cell 110 has been scanned, array 27 makes a sub-jump in the negative y-direction given by expression (9):

$\begin{matrix} {{Subjump} = \frac{{FOV}_{y}}{J}} & (9) \end{matrix}$

By way of example, I is assumed to be 3, so that from expression (5) J=16, and each rectangle 82 is divided into 16 respective cells 110, identified herein as cells 110 ₁, 110 ₂, . . . 110 ₁₆, as shown in FIG. 4. Herein cells 110 of rectangle 82 are assumed to be formed as a 4×4 array, but it will be appreciated that any other convenient array of 16 cells, such as a 2×8 array, may be used. As stated above, the value of FOV_(y) is 480 μm, so that from expression (9) the value of the sub-jump is 30 μm.

In the initial scanning phase a first set of cells, e.g. cells 110 ₁, is scanned, taking a time T_(c) for the scan, after which array 27 makes the sub-jump, and a different set of cells, e.g., cells 110 ₃, is scanned. The process of scanning then sub-jumping continues. In the initial scanning phase some of cells 110 of rectangles 82 are not scanned; once the initial scanning phase has completed, all cells 110 of every rectangle 82 are scanned.

FIG. 5 is a diagram showing a snapshot 120 of cells 110 scanned by array 27, according to the alternative embodiment of the present invention. Snapshot 120 is of the left-hand set of rectangles 82 of array 27 in FIG. 4, and is shown after 15 sub-jumps have been made, so that 16 scans of array 27 have been made, a 16^(th) sub-jump is about to be made, and a 17^(th) scan is about to start. Snapshot 120 also shows a lower left margin of region 30. In the example described herein, cells that have been scanned during a first scan are labeled with “1”, cells scanned during a second scan are labeled with “2”, . . . , and cells scanned during a 16^(th) scan are labeled with “16.” Other scanned cells are labeled with “/,” and cells that have not been scanned are empty.

Snapshot 120 illustrates that after a first scan of cells 110 ₁, a sub-jump of 30 μm in the negative y-direction is made, and a second scan of cells 110 ₃ is made. For the 16^(th) scan, cells 110 ₂ are scanned, and once this scan has completed, all cells of the three sets of rectangles 82 closest to the lower edge of region 30, labeled 82 ₁, have been scanned. For the 17^(th) scan, the process begins to repeat, i.e., un-scanned cells 110 ₁ are scanned, and during this scan, the V_(stage) component applied to array 27 moves the three sets of rectangles 82 ₂ to the lower edge of region 30. At completion of the 17^(th) scan rectangles 82 ₂ will also have been completely scanned. Similarly, at completion of the 18^(th) scan rectangles 82 ₃ will be closest to the lower edge of region 30, and will have been completely scanned.

FIG. 6 is a diagram 130 showing parameters of scanned rectangles 82 after 21 scans have been made, according to the alternative embodiment of the present invention. After each scan a sub-jump equal to the y-height of three rectangles 82 is made. In diagram 130, a column 132 refers to the set of rectangles 82 ₁, . . . by the subscript of the rectangles, and it will be understood that each set comprises three rectangles 82. In diagram 130 it is assumed that in the first scan cells 110 ₁ are scanned, in the second scan cells 110 ₂ are scanned, . . . , and in the 16^(th) scan, cells 110 ₁₆ are scanned. A region 134 of the diagram shows cells scanned during each scan; in the region the cells are referred to by their subscripts.

As shown by a column 136, and as has also been described above with reference to FIG. 5, after an initial scanning phase of 15 scans, all 16 cells of rectangles 82 begin to be scanned. Thus, for scans 16-21, complete rectangles 82 are scanned.

It will be appreciated that the process described above with reference to FIGS. 4-6 may be applied to completely scan a swathe 39 of surface 22, substantially regardless of the length of the swathe. Typically, in order to completely scan the beginning of the swathe, the initial scanning phase is started before region 30 completely covers surface 22, so that complete scanning of rectangles 82 begins as region 30 does completely cover surface 22.

It will also be appreciated that for the process described with reference to FIGS. 4-6 the relative beam positions of the spots generated by array 27 do not need to be very accurate, and that any inaccuracies may be taken care of by including a small overlap between cells 110. Furthermore, because of the small sub-jump size, the value of FOV_(y) for array 27 may be set to be nearly equal to the y-value of aberration-corrected region 30.

FIG. 7 is a diagram 150 showing a relation between the scanned FOV_(y) and the dimensions of dies 25, according to an embodiment of the present invention. Typically, dies 25 of surface 22 (FIG. 1B) are rectangular with substantially identical external dimensions. In a number of applications of the scans described herein it will be appreciated that it is advantageous that identical locations within respective dies are scanned by the same spot of array 27, and that the locations when scanned have the same offset with respect to optical axis 29. This condition is herein termed “die periodicity.” It will be understood that maintaining die periodicity facilitates die-to-die comparisons; other applications where maintaining die periodicity is advantageous will be apparent to those skilled in the art. If the y-dimension of each die 25 is D_(y), then identical locations of dies will be scanned by the same spot of array 27, i.e., die periodicity holds, when an integral number of scans exactly fill the die, corresponding to expression (10) being true:

$\begin{matrix} {{FOV}_{y} = \frac{D_{y}}{I}} & (10) \end{matrix}$

-   -   where I is a positive integer.

For example, if D_(y) is 15.4 mm (15,400 μm), and FOV_(y) is 480 μm, as in the example above, then expression (10) is not true, as is illustrated by section 152 of diagram 150. In section 152 each die 25 has an unscanned region 156, and die periodicity does not hold. To cause die periodicity to hold, a relatively small change of magnification of array 27 may be made, changing the value of FOV_(y) so that expression (10) is true. For example, if FOV_(y) is changed to 481.25 μm, then

${481.25 = \frac{15,400}{32}},$ and die periodicity holds, as is illustrated by section 154 of the diagram.

FIG. 8 is a diagram 170 illustrating an alternative method for maintaining die periodicity, according to an embodiment of the present invention. In order to maintain die periodicity for the embodiment described above with respect to FIGS. 4-6, processor 36 applies two different sized sub-jumps to array 27, as shown in a row 174, wherein “A” indicates a sub-jump of one size and “B” indicates a sub-jump of a different size. Processor 36 also adjusts the velocity V_(stage) of stage 38 so that the constant distance Y_(stage) traveled by stage 38 during each of the sub-jumps lies between the two sub-jump sizes. The distances Y_(stage) traveled by the stage are shown in a row 176, and processor 36 adjusts their number so that they exactly total a die size D_(y), shown in row 172. Processor 36 also adjusts the numbers of the two sub-jumps, A, B, so that the total distance traversed by the sub-jumps is exactly equal to D_(y).

The following description details one method for selecting the size and number of the sub-jumps, and the distance traveled by the stage during each sub-jump, in order to maintain die periodicity. Other methods will be apparent to those having ordinary skill in the art, and all such methods are assumed to be included within the scope of the present invention.

The description uses the following terms:

-   -   D_(y), die size in the y-direction;     -   Nby, the number of rows of spots in array 27;     -   pitch, the distance between spots;     -   Ncell, the number of cells per rectangle 82; and     -   J, the number of rectangles 82 in the sub-jumps of the         embodiment described above with respect to FIGS. 4-6.

J, Nby, and Ncell are related by the expression: Nby=J·Ncell  (11)

Values of non-negative integers D, E, and F, are selected so that the following expressions are true: D _(y) =D·pitch+d (0≦d≦pitch) D=E·Nby−F(0≦F<NBy)  (12)

Processor 36 implements N_(J) sub-jumps of size Y_(J) and N_(J−1) sub-jumps of size Y_(J−1), where:

$\begin{matrix} \begin{matrix} {{N_{J} = \left( {{E \cdot {Ncell}} - F} \right)},} \\ {Y_{J} = \frac{J \cdot D_{y}}{D}} \\ {{N_{J - 1} = F},} \\ {Y_{{J - 1}\;} = \frac{\left( {J - 1} \right) \cdot D_{y}}{D}} \end{matrix} & (13) \end{matrix}$

After each sub-jump, processor 36 sets array 27 to scan a different cell, from 1 to Ncell, of rectangles 82, and repeats the scanning sequence.

From expressions (13), the total number of sub-jumps, N_(J)+N_(J−1), is E·Ncell. Since E is an integer, this is an integral multiple of Ncell, so that each period begins on the same cell as the previous period.

Processor 36 sets the velocity of stage 38, V_(stage), so that between sub-jumps, i.e., during the scanning of each cell, the stage moves in the y-direction by the value given by expression (14) below. Since processor 36 moves array 27 with the same velocity V_(stage) during scanning of the array, array 27 moves in the y-direction by the same amount.

$\begin{matrix} {Y_{stage} = \frac{D_{y}}{\left( {N_{J} + N_{J - 1}} \right)}} & (14) \end{matrix}$

Using expressions (13) above, the total distance traversed in one period for all the sub-jumps is given by: N_(J)·Y_(J)+N_(J−1)·Y_(J−1)  (15)

Expression (15) can be simplified, using expressions (11), (12) and (13) as follows:

$\begin{matrix} \begin{matrix} {{{N_{J} \cdot Y_{J}} + {N_{J - 1} \cdot Y_{J - 1}}} = {\left( {{N_{J} \cdot J} + {N_{J - 1} \cdot \left( {J - 1} \right)}} \right) \cdot \frac{D_{y}}{D}}} \\ {= {\left( {{\left( {{E \cdot {Ncell}} - F} \right) \cdot J} + {F\left( {J - 1} \right)}} \right) \cdot \frac{D_{y}}{D}}} \\ {= {\left( {{E \cdot J \cdot {Ncell}} - F} \right) \cdot \frac{D_{y}}{D}}} \\ {= {\left( {{E \cdot {Nby}} - F} \right)\frac{D_{y}}{D}}} \\ {= D_{y}} \end{matrix} & (16) \end{matrix}$

Expression (16) shows that the total distance traversed by all the sub-jumps is D_(y); since, as described above, each period begins on the same cell as the previous period, die periodicity is maintained.

Below are numerical examples illustrating the expressions (11)-(16). The following values are assumed:

-   -   D_(y)=15.4 mm=15,400 μm;     -   Nby=48;     -   Pitch=10 μm;     -   Ncell=16; and     -   J=3.

Expressions (12) are satisfied by the values: {d, D, E, F}={0, 1540, 33, 44} and {d, D, E, F}={10, 1539, 33, 45}; these values are respectively referred to below as Example I and Example II.

Expressions (13) generate the following values:

-   -   Example I: N_(J)=484, Y_(J)=30 μm;     -   N_(J−1)=44, Y_(J−1)=20 μm.     -   Example II: N_(J)=483, Y_(J)=30.019 μm;     -   N_(J−1)=45, Y_(J−1)=20.013 μm.

For both examples, the distance Y_(stage) given by expression (14) is Y_(stage)=29.167 μm.

Both examples illustrate that by using two sub-jumps, one having a value greater than Y_(stage), the other having a value less than Y_(stage), die periodicity may be maintained.

In performing a scan of surface 22, the two sub-jumps calculated above may be interspersed in any convenient manner. Typically, the interspersion is performed so that array 27 deviates as little as possible from axis 29 of apparatus 10. Methods for calculating such interspersion are well known in the art; following are two Matlab functions that may be used to calculate a satisfactory interspersion:

function [NJ, NJ1, Ystage, YJ, YJ1, overhead] = APBS_algo(Dy, pitch, J, Ncell); D = fix(Dy / pitch); d = Dy − D*pitch; NBy = J*Ncell; E = 1 + fix((D−1)/NBy); F = E*NBy − D; NJ = E*Ncell−F; YJ = J* Dy./D; NJ1 = F; YJ1 = (J−1)*Dy./D; N = NJ + NJ1; % total number of jumps Ystage = die ./ N; overhead = (J−1)*d./die + pitch*F./die; return function [Jump] = APBS_jump(Dy, pitch, J, Ncell); % Jump = output vector of optimal jump sequence % (example: [J J J−1 J . . . ]′ ) [NJ, NJ1, Ystage, YJ, YJ1, overhead] = APBS_algo(Dy, pitch, J, Ncell); N = NJ + NJ1; dY = YJ − Ystage; dY1 = YJ1 − Ystage; Jump = [J]; Offset = dY; for i=2:N, Offset_J = Offset + dY; Offset_J1 = Offset + dY1; if abs(Offset_J)< abs(Offset_J1), Jump = [Jump; J]; Offset = Offset_J; else Jump = [Jump; J−1]; Offset = Offset_J1; end end return

It will be appreciated that while the description above is generally directed to multiple scanning beam apparatus used for inspection, the scope of the present invention comprises other types of scanning beam apparatus, such as is used for lithography. It will also be appreciated that embodiments of the present invention significantly relax tolerances on beam positions for optical beams, since there is no relationship between the direction of motion of the scanning stage and the positions of the beams within their array. Furthermore, embodiments of the present invention significantly increase the size of the array that may be used, while enabling the array to be kept within a specific region, such as an aberration-free region. Also, embodiments of the present invention enable die periodicity to be maintained, without compromising on the advantages described above.

Returning to FIGS. 1A and 1B, optical inspection apparatus 10 uses a beam multiplexer 14, followed by a scanner 28, the two elements in combination providing an arrangement for generating an optical beam array, and for scanning the array as described above. FIGS. 9-12 below illustrate alternative arrangements for generating and scanning the beam array. In the following descriptions, elements indicated by the same reference numerals in FIGS. 1A, 1B, 9, 10, 11, and 12 are generally identical in construction and in operation.

FIG. 9 is a schematic illustration of an apparatus 200, according to an embodiment of the present invention. Apart from the differences described below, the operation of apparatus 200 is generally similar to that of apparatus 10 (FIGS. 1A and 1B). In apparatus 200, scanner 28 comprised in optics 20, and multiplexer 14, are replaced. Other elements of optics 20 are also replaced/altered as necessary to form objective optics 226, which have the same function as optics 20; the required replacements/alterations will be apparent to those having ordinary skill in the art. Laser 12 is followed by two acousto-optic deflectors (AODs) 212 and 214, which are driven at respective radio-frequencies RF1 and RF2 by processor 36. AODs 212 and 214 are configured to deflect the incoming source beam from laser 12 in two orthogonal directions. The AODs enable processor 36 to adjustably scan the substantially parallel source beam from laser 12 independently in a local x and y direction, assumed to be orthogonal to the axis of the beam. Thus, processor 36 is able to scan the beam from laser 12 in a raster scan pattern similar to that shown in raster scan 80 (FIG. 2B).

Optics 216 receive the deflected laser beam from the AODs, and act to direct a substantially parallel beam to a Dammann grating 218, which acts to multiplex the incoming parallel beam. Dammann gratings are known in the art, and are described in articles by Dammann et al. in Opt. Commun. 3, 312 (1971) and in Opt. Acta 24, 505 (1977). Optics 216 typically comprise two converging lenses having substantially the same focal length f, separated by a distance 2 f. The AODs are positioned at a focal plane of the first lens of optics 216, and grating 216 is positioned at a focal plane of the second lens of the optics. Dammann grating 218 is constructed to generate a multiplicity of beams 220, each beam corresponding to a diffraction order. The number and orientation of beams 220 is chosen to correspond to those described above (FIG. 2A and FIG. 4), or to any other convenient number and orientation. In some embodiments of the present invention, grating 218 comprises two or more separate Dammann gratings, arranged to generate the required number and orientation of beams 220.

One or more lenses 224 receive beams 220 and form the beams into a corresponding multiplicity of beams 222 which are approximately parallel to each other. Lenses 224, together with optics 226, direct beams 222 to form array of focused spots 27 on surface 22 of wafer 24. In operation of apparatus 200, spots 27 and wafer 24 are operative substantially as described above for apparatus 10.

FIG. 10 is a schematic illustration of an apparatus 300, according to an embodiment of the present invention. Apart from the differences described below, the operation of apparatus 300 is generally similar to that of apparatus 200 (FIGS. 1A, 1B, and 9).

In apparatus 300, instead of AODs 212 and 214, processor 36 operates a scanning mirror, herein assumed to be a plane mirror 302, which is configured to adjustably scan the beam from laser 12 independently along the local x and y directions. The remainder of the operation of apparatus 300 is substantially as described for apparatus 200.

FIG. 11 is a schematic illustration of an apparatus 400, according to an embodiment of the present invention. Apart from the differences described below, the operation of apparatus 400 is generally similar to that of apparatus 200 (FIGS. 1A, 1B, and 9).

In apparatus 400, one or more lenses 404 and an array 402 are used instead of Dammann grating 218. Array 402 and lenses 404 act as a multiplexer, generating beams 406 for one or more objective lenses 408. Array 402 comprises a regularly spaced two-dimensional array of refractive and/or diffractive elements 403. If array 402 comprises refractive elements, elements 403 are typically micro-lenses; if array 402 comprises diffractive elements, elements 403 are typically zone plate arrays. Elements 403 receive a generally parallel beam from optics 216, and focus the beam to a multiplicity of intermediate spots 405. Lenses 404 and 408 form spots 405 into beams 222, and together with optics 226 the lenses and the optics direct beams 222 to form the array of focused spots 27. The remainder of the operation of apparatus 400 is substantially as described for apparatus 200.

FIG. 12 is a schematic illustration of an apparatus 500, according to an embodiment of the present invention. Apart from the differences described below, the operation of apparatus 500 is generally similar to that of apparatus 400 (FIGS. 1A, 1B, and 11). In apparatus 500, instead of AODs 212 and 214, a scanning mirror system generally similar to that of apparatus 300 (FIG. 10) is used.

It will be appreciated that other combinations of elements described above with reference to FIGS. 9-12 may be used to generate and scan the multiplicity of beams required to produce array 27. For example, scanning may be accomplished using a combination of mirror scanning in one dimension, and AOD scanning in an orthogonal direction. Alternatively or additionally, the mirror scanning may be implemented using a polygonal mirror, rather than the plane scanning mirror exemplified above. Typically, the polygonal mirror rotates about a symmetry axis of the mirror, and scans in one direction. As is known in the art, scanning of a polygonal mirror may be combined with another scanning mechanism, such as a plane scanning mirror or AOD described above, to form a two-dimensional scanning system. The two-dimensional scanning system so formed generates the raster scan pattern referred to above.

It will thus be appreciated that the embodiments described above are cited by way of example, and that the present invention is not limited to what has been particularly shown and described hereinabove. Rather, the scope of the present invention includes both combinations and subcombinations of the various features described hereinabove, as well as variations and modifications thereof which would occur to persons skilled in the art upon reading the foregoing description and which are not disclosed in the prior art. 

1. An apparatus for scanning a surface, comprising: a beam generator for generating an array of optical beams, the beam generator comprising a light source which projects a source beam along a source axis; beam optics having an optical axis, said beam optics adapted to focus the array of optical beams to illuminate a region of the surface intercepted by the optical axis, wherein each optical beam illuminates a portion of a respective sub-region within the region; a translator for translating at least one of the array and the surface to cause a surface velocity relative to the optical axis in a first direction; and an array scanner, for scanning, during the translating, each of the optical beams over the respective sub-region in a second direction, different from the first direction, said scanning defining an array of raster patterns, each raster pattern covering a respective sub-region and comprising rows perpendicular to said first direction, wherein the array scanner comprises a deflector which deflects the source beam from the source axis and said scanning comprising scanning a predetermined number of said rows, and then jumping in a direction substantially opposite to the first direction, thereby repeating said raster pattern over a new region of the surface, and wherein the deflector comprises an acousto-optic deflector (AOD) adapted to adjustably deflect the source beam.
 2. The apparatus according to claim 1, wherein the AOD comprises two AODs configured to deflect the source beam in mutually orthogonal directions.
 3. An apparatus for scanning a surface, comprising: a beam generator for generating an array of optical beams, the beam generator comprising a light source which projects a source beam alone a source axis; beam optics having an optical axis, said beam optics adapted to focus the array of optical beams to illuminate a region of the surface intercepted by the optical axis, wherein each optical beam illuminates a portion of a respective sub-region within the region; a translator for translating at least one of the array and the surface to cause a surface velocity relative to the optical axis in a first direction; and an array scanner, for scanning, during the translating, each of the optical beams over the respective sub-region in a second direction, different from the first direction, said scanning defining an array of raster patterns, each raster pattern covering a respective sub-region and comprising rows perpendicular to said first direction, wherein the array scanner comprises a deflector which deflects the source beam from the source axis and said scanning comprising scanning a predetermined number of said rows, and then jumping in a direction substantially opposite to the first direction, thereby repeating said raster pattern over a new region of the surface, and wherein the deflector comprises a plane mirror adapted to variably deflect the source beam.
 4. The apparatus according to claim 3, wherein the plane mirror comprises one plane mirror which is adapted to rotate in a plurality of directions so as to deflect the source beam in two dimensions.
 5. An apparatus for scanning a surface, comprising: a beam generator for generating an array of optical beams, the beam generator comprising a light source which projects a source beam along a source axis; beam optics having an optical axis, said beam optics adapted to focus the array of optical beams to illuminate a region of the surface intercepted by the optical axis, wherein each optical beam illuminates a portion of a respective sub-region within the region; a translator for translating at least one of the array and the surface to cause a surface velocity relative to the optical axis in a first direction; and an array scanner, for scanning, during the translating, each of the optical beams over the respective sub-region in a second direction, different from the first direction, said scanning defining an array of raster patterns, each raster pattern covering a respective sub-region and comprising rows perpendicular to said first direction, wherein the array scanner comprises a deflector which deflects the source beam from the source axis and said scanning comprising scanning a predetermined number of said rows, and then jumping in a direction substantially opposite to the first direction, thereby repeating said raster pattern over a new region of the surface, and wherein the deflector comprises a polygonal plane mirror having an axis of symmetry, and wherein the polygonal plane mirror is adapted to rotate about said axis of symmetry.
 6. The apparatus according to claim 1, wherein the beam generator comprises a light source which projects a source beam to a multiplexer, and wherein the multiplexer is adapted to divide the source beam into the array of optical beams so as to have a predetermined spatial relationship with each other.
 7. The apparatus according to claim 6, wherein the multiplexer comprises at least one Dammann grating.
 8. The apparatus according to claim 6, wherein the multiplexer comprises an array of refractive elements.
 9. The apparatus according to claim 6, wherein the multiplexer comprises an array of diffractive elements.
 10. A method for scanning a surface, comprising: generating an array of optical beams from a light source which projects a source beam along a source axis; focusing the array of optical beams using optics having an optical axis, so as to illuminate a region of the surface intercepted by the optical axis, wherein each optical beam illuminates a portion of a respective sub-region within the region; translating at least one of the array of optical beams and the surface so as to cause a surface velocity relative to the optical axis in a first direction; and during said translating, scanning the array of optical beams over the region in a second direction, different from the first direction, by deflecting the source beam from the source axis using an acousto-optic deflector (AOD) to adjustably deflect the source beam, said scanning defining an array of raster patterns, each said raster pattern covering a respective sub-region and comprising rows perpendicular to said first direction, said scanning comprising: scanning a predetermined number of said rows, and then jumping in a direction substantially opposite to the first direction, thereby repeating said raster pattern over a new region of the surface.
 11. The method according to claim 10, wherein the AOD comprises two AODs configured to deflect the source beam in mutually orthogonal directions.
 12. A method for scanning a surface, comprising: generating an array of optical beams from a light source which projects a source beam along a source axis; focusing the array of optical beams using optics having an optical axis, so as to illuminate a region of the surface intercepted by the optical axis, wherein each optical beam illuminates a portion of a respective sub-region within the region; translating at least one of the array of optical beams and the surface so as to cause a surface velocity relative to the optical axis in a first direction; and during said translating, scanning the array of optical beams over the region in a second direction, different from the first direction, by deflecting the source beam from the source axis using a plane mirror to variably deflect the source beam, said scanning defining an array of raster patterns, each said raster pattern covering a respective sub-region and comprising rows perpendicular to said first direction, said scanning comprising: scanning a predetermined number of said rows, and then jumping in a direction substantially opposite to the first direction, thereby repeating said raster pattern over a new region of the surface.
 13. The method according to claim 12, wherein the plane mirror comprises one plane mirror which is adapted to rotate in a plurality of directions so as to deflect the source beam in two dimensions.
 14. A method for scanning a surface, comprising: generating an array of optical beams from a light source which projects a source beam along a source axis: focusing the array of optical beams using optics having an optical axis, so as to illuminate a region of the surface intercepted by the optical axis, wherein each optical beam illuminates a portion of a respective sub-region within the region; translating at least one of the array of optical beams and the surface so as to cause a surface velocity relative to the optical axis in a first direction; and during said translating, scanning the array of optical beams over the region in a second direction, different from the first direction, by deflecting the source beam from the source axis by rotating a polygonal plane mirror having an axis of symmetry about said axis of symmetry, said scanning defining an array of raster Patterns, each said raster pattern covering a respective sub-region and comprising rows perpendicular to said first direction, said scanning comprising: scanning a predetermined number of said rows, and then jumping in a direction substantially opposite to the first direction, thereby repeating said raster pattern over a new region of the surface.
 15. The method according to claim 10, and comprising generating the array f optical beams from a light source which projects a source beam to a multiplexer, and wherein the multiplexer is adapted to divide the source beam into the array of optical beams so as to have a predetermined spatial relationship with each other.
 16. The method according to claim 15, wherein the multiplexer comprises at least one Dammann grating.
 17. The method according to claim 15, wherein the multiplexer comprises an array of refractive elements.
 18. The method according to claim 15, wherein the multiplexer comprises an array of diffractive elements. 